Masked Autoencoders Are Scalable Vision Learners | home

MAE Structure MAE qualitative results MAE qualitative results pre-train mask ratio lower than inference